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Abstract 

In  logic  programming  the  unification  process  involves  reading  the  entire  term 
from  the  beginning  up  to  the  place  where  the  needed  information  is  stored  (cf. 

[1]) .  It  is  clear  that  a  reasonable  system  with  the  direct  access  to  the  information 
might  be  more  efficient.  For  example,  a  binary  tree  of  data  is  usually  presented 
by  a  term  and  the  standard  PROLOG  algorithm  even  when  it  knows  the  path 
leading  to  the  desired  node  is  forced  to  read  the  term  symbol  by  symbol  in  order 
to  understand  the  subtrees  structure  of  the  term.  However  if  we’ll  teach  this 
algorithm  how  to  utilize  the  information  about  the  addresses  of  the  immediate 
successor  nodes,  which  is  in  fact  stored  on  the  microlevel  together  with  each 
node,  then  the  algorithm  will  read  only  log  of  the  length  of  the  term.  To 
provide  data  structures  with  the  direct  access  we  have  to  connect  a  PROLOG- 
like  language  with  some  external  system  that  will  support  the  descriptions  of 
such  structures.  We  consider  the  principles  of  designing  such  external  systems 
based  on  the  ideas  and  methods  of  labelled  modal  logic. 

We  introduce  reference  structures  -  a  basic  mathematical  model  of  a  data 
organization  capable  to  store  and  utilize  information  about  its  addresses.  A 

*  Supported  by  the  ARO  under  the  MURI  program  “Integrated  Approach  to  Intelligent  Systems”, 
grant  no.  DAA  H04-96- 1-0341. 

1  Partially  supported  by  the  grant  No.95-01-00416  of  the  Russian  Foundation  for  Fundamental 
Research. 
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propositional  labelled  modal  language  is  used  as  a  specification  and  program¬ 
ming  language  for  reference  structures;  the  satisfiability  algorithm  for  modal 
language  gives  a  method  of  building  and  optimizing  reference  structures  sat¬ 
isfying  a  given  formula.  Corresponding  labelled  modal  logics  are  presented, 
supplied  with  cut  free  axiomatizations,  completeness  and  decidability  theorems 
are  proved.  Initialization  of  typed  variables  in  some  programming  languages  is 
presented  as  an  example  of  a  reference  structure  building. 


1  Introduction 

We  suggest  to  interpret  a  labelled  modal  formula  |m]A  as  “memory  cell  m  stores 
sentence  A”  and  to  treat  propositional  variables  as  names  of  the  cell  contents.  The 
labelled  modal  language  allows  to  keep  control  over  both  unification  of  names  and 
validity  of  the  information  stored. 

All  this  eventually  makes  it  possible  to  do  some  sort  of  programming  of  referential 
data  structures  by  means  of  labelled  modal  language  in  the  following  way. 

We  consider  a  language  with 

•  atomic  data  constants  Ci,  C2, . . ., 

•  variables  mi,  m2, . . .  for  memory  cell  addresses, 

•  operation^of  reading  the  contents  of  a  cell,  operation  !•](•)  for  storing  informa¬ 
tion  to  a  cell,  boolean  connectives. 

A  formula  in  this  language  may  be  regarded  as  a  specification  of  a  memory  config¬ 
uration  which  stores  data  files  ci,C2, . . .  together  with  an  information  about  contents 
of  other  cells,  location  of  files,  etc.  The  standard  completeness  and  cut  elimination 
proof  of  a  corresponding  logic  of  refence  structures  in  fact  gives  an  algorithm  which 
verifies  the  unifiability  of  names  and  semantical  correctness  of  this  specification  and 
in  a  positive  case  provides  a  data  allocation  table  in  abstract  addresses. 

The  compiling  problem  turns  out  to  be  AP-complete.  The  corresponding  algo¬ 
rithm  suggested  in  the  current  paper  is  a  hybrid  of  the  unification  and  some  sort  of 
boolean  satisfiability  procedures. 

The  restriction  of  the  underlying  objects  to  sentences  (with  validity  relation  on 
them)  does  not  lead  to  a  loss  of  generality  for  our  purposes:  if  a  proper  data  c,- 
originally  represents  a  number  N,  we  assume  that  c,-  is  the  sentence  “this  is  a  number 
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iV”;  the  same  treatment  may  be  given  to  other  sorts  of  proper  information:  terms, 
names,  addresses,  etc. 

The  general  definition  of  a  reference  structure  covers  not  only  a  wide  class  of 
computer  data  organizations,  but  also  cross-references  with  built  in  reference  assign¬ 
ments  in  formal  languages,  the  system  of  proofs  and  theorems  in  a  formal  theory, 
etc.  However  in  the  current  paper  we  restrict  this  general  definition  to  pure  reference 
structures  closely  oriented  to  the  computer  data  bases.  Since  there  will  be  no  others 
here  we  will  use  a  general  name  reference  structures  for  the  pure  ones. 

2  Reference  Structures 


2.1  Definition.  The  language  £(M,  C)  of  a  reference  structure  depends  on  two  sets 
M  and  C  and  is  defined  as  follows.  Let  C  =  {ci,  Ca, . . . ,  T}  be  a  set  of  data  constants, 
which  will  represent  a  proper  information  to  be  stored  in  a  reference  structure.  Let 
also  M  =  {1, 2, . . .}  be  a  set  of  memory  cells.  The  language  of  a  reference  structure 
contains  storage  operators  [2l(-)5  •  •  •>  memory  cell,  together  with 

usual  boolean  connectives  {A,V,“',  T}.  For  any  cell  m  E  M  there  is  a  reference 
variable  u„i,  =  m  for  short.  One  should  not  be  mislead  by  the  notation:  1,2... 
are  indeed  variables,  not  constants,  since  the  reading  function  corresponding  to  ^  will 
itself  be  a  parameter  of  a  reference  structure.  We  denote  by  V  the  set  of  all  reference 
variables.  The  set  of  formulas  Fm[M,  C)  is  the  least  set  such  that 

C,V  C  Fm{M,C), 

if  m  6  M  and  A  €  Fm{M,  C),  then  [m]A  €  Fm{M,  C), 
a  A,  BE  Fm{M,  C),  then  {A  A  B),  {A  V  B),  {A  -)•  B),  (-nA)  G  Fm{M,  C). 


2.2  Definition.  A  formula  is  ground  if  it  contains  no  reference  variables,  under 
St{M,C)  we  mean  the  set  of  all  ground  formulas  of  the  language  C{M,C).  A  sub¬ 
stitution  is  a  partial  mapping  6  :  V  — >  Fm{M,C)‘,  ^  is  a  solution  of  an  equation 
A  =  B,  for  A,B  E  Fm{M,  C),  if  A0  =  B9.  Substitution  0  is  a  solution  of  a  relation 
RC  M  X  Fm{M,  C)  if  0  is  a  solution  of  m  =  A  for  every  (m.  A)  E  R.  A  substitution 
9  :  V  — y  St(M,  C)  is  called  ground  substitution.  We  assume  that  all  atom  constants 
are  valid.  Any  ground  solution  ^  of  a  relation  R  Q  M  x  Fm{M,  C)  naturally  defines 
a  validity  relation  g  on  all  ground  formulas: 
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^  ^  C, 

{m  £  Dom  R  and  mO  =  A), 

1=^  g  respects  boolean  connectives. 

A  ground  solution  ^  of  a  relation  R  is  valid  on  iV  C  M,  if  §  fi6  for  all  n  €  A.  A 
storage  table  is  a  functional  relation  RC  M  x  Fm{M,  C)  between  memory  cells  and 
formulas. 


2.3  Definition.  A  reference  structure  is  a  storage  table  which  has  a  ground  solution 
6  valid  on  Dom  R,  i.e.  a  storage  table  with  all  stored  sentences  to  be  true. 


2.4  Comment.  The  relation  R  is  a  system  of  assigning  memory  cells  to  formulas 
from  Pm{M,  C)  which  is  consistent  from  both  combinatorial  and  semantical  sides. 
The  cells  which  are  not  in  Dom  R  are  called  empty.  The  reserve  of  empty  cells  is 
both  realistic  and  technically  convenient.  If  R  has  a  ground  solution  satisfying  the 
definition  of  a  reference  structure  above,  then  R  has  such  a  solution  which  is  total  on 
V.  Without  loss  of  generality  we  assume  that  0  is  already  total  and  call  it  a  reading 
procedure  of  R.  A  reading  procedure  provides  a  ground  picture  of  the  cell  contents 
where  all  the  references  are  already  given  their  “real”  meaning  in  terms  of  proper 
information  and  storage  connections.  On  empty  cells  a  reading  procedure  returns 
some  ground  sentences  which  may  be  regarded  as  sort  of  “error  messages” . 

2.5  Definition.  Let  i?  be  a  reference  structure  and  6  -  its  reading  procedure. 
With  a  pair  3?  =  {R,  9)  we  associate  a  validity  relation  [=  defined  on  all  formulas  from 
Pm{M,C): 

3?  [=  A  ■4=^  ^R,e 

It  is  easy  to  see  that  |=  is  an  extention  of  the  “old”  validity  relation  g  from 
StiM,  C)  to  Fm(M,  C).  Also,  3?  f=  ^  for  all  A  €  Val  R. 

2.6  Lemma. 

3?  [=  |m]A  =>  3?  1=  A. 


Proof  follows  immediately  from  the  definitions. 
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2.7  Lemma.  The  following  are  equivalent 

1.  m  is  nonempty, 

2.  ^  \=  \rn\B  for  some  formula  B, 

3.  |m]m. 

Proof.  3.  2.  1.  are  trivial.  We  prove  the  remaining  1.  ^3.  If  m  €  Dom  R, 

then  (m,  A)  E  R  for  some  formula  A  mO  =  AO,  hence  \=j^  0  [m|A0  and  3?  |=  |m]A.  ■ 

We  can  see  now  how  the  decision  to  have  a  reserve  of  empty  cells  increases  the 
expressive  power  of  the  reference  structures  language.  For  example,  the  fact  that 
m  G  M  is  not  empty  can  now  be  expressed  by  a  formula  [mjm,  which  we  will  denote 
m  and  will  use  as  a  natural  sentence  format  pointer.  The  meaning  of  m  as  a  pointer 
is  assumed  to  be  built  in  the  search  algorithm.  Note  that  the  length  of  m  can  be 
easily  made  of  the  order  of  the  length  of  m  and  m.  i.e.  ’’very  small”. 

2.8  Example.  A  list  of  ci,  C2, . . . ,  c„  may  be  described  as  the  ground  reference  struc¬ 
ture  R  over  M  =  {1, 2, . . . ,  n  -f- 1}  and  C  =  {ci,  C2, ...,  c„}  as 

R  =  {(!>  ^l)^  Tn),  (n  +  1,  T)} 

for  (pi  =  Ai  A  [2|(;i'2,  (p2  =  A2  A  [3]v?3, . . . ,  A  [n  +  1]T.  Here  T  works  as  a 

marker  of  the  end  node.  The  list  can  be  represented  by  the  formula 

It  does  not  mean,  however  that  we  intend  to  store  the  entire  list  in  one  cell  1.  We 
will  see  now  how  a  regular  reference  structure  ’’list”  looks  like: 

R  =  {{i,Ai  A  2), (2,^2  A  3),..  .,(n,  An-t-  l),(n  A  1,T)}. 

The  entire  reference  structure  can  now  be  represented  by  the  formula 

[ll(Ai  A  2))  /\ . . .  /\  In](A„  A  n  -t- 1)  /\  [n  +  IjT . 


The  main  question  here  is  how  to  decide  whether  there  exists  a  reference  structure 
satisfying  given  storage  description,  and  to  construct  one  if  it  exists.  A  finite  equation 
system  alone  can  be  solved  in  linear  time  (cf.[5],  [6]).  The  semantic  component 
however  spoils  the  picture:  the  problem  immediately  becomes  at  least  iVP-hard, 
since  it  naturally  includes  the  satisfiability  problem  for  the  classical  propositional 
logic.  Below  we’ll  show  that  it  is  AP-complete. 
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3  Logic  of  reference  structures 

Usually  the  Unification  Algorithm  deals  with  finite  systems  of  “unconditional”  equal¬ 
ities  of  the  form  A  =  B.  Fast  algorithms  of  solving  such  systems  were  suggested  in 
[5]  (cf.  also  [7]).  We  assume  that  formulas  are  presented  as  directed  acyclic  graphes 
with  shared  variables  (dags)  which  allow  lineartime  unification  ([5]). 

We  will  also  be  interested  in  the  “conditional”  equalities  of  the  form 

/  Ui  =  Vi,  iel  .jx 

\  Sj  =  Wj  =^>  Ui  =  j€J  ^  ’ 

For  a  convenience  we  consider  some  deterministic  variant  of  the  Unification  Algo¬ 
rithm  by  fixing  an  order  of  the  equations  for  this  algorithm  to  choose.  The  suitable 
modification  U  of  the  unification  algorithm  for  “conditional”  equalities  works  as  fol¬ 
lows.  Using  the  standard  unification  algorithm  solve  the  unconditional  part  of  the 
system  and  calculate  its  m.g.u.  cr.  Then  pick  a  “conditional”  equality  and  check  the 
conditions 

SjCT  =  Wja. 

If  the  condition  fails,  then  take  the  next  “conditional”  equality.  If  the  conditions  are 
fulfilled,  add  the  succedent  equality  to  the  unconditional  part  and  solve  the  system 
again.  The  process  terminates  when  the  checking  procedure  fails  to  add  new  equalities 
or  the  unification  algorithm  fails  to  solve  a  current  unconditional  part  of  the  system. 
The  standard  argument  proves  that  this  modification  gives  the  most  general  unifier 
(m.g.u.)  of  the  system  with  “conditional”  equations.  The  standard  m.g.u.  of  the  set 
of  equations  (1)  is  the  m.g.u.  obtained  by  U. 

3.1  Lemma,  (cf.  [3]).  Let  a  be  the  standard  m.g.u.  of  a  “conditional”  system  (1). 
Then 

1 .  all  variables  occurring  in  cr  are  from  (1), 

2.  Dom{cr)  D  Val{cr)  =  0, 

3.  <T  is  idempotent,  i.e.  cr  o  a  =  a, 

4.  for  every  solution  9  of  (1)  there  exists  a  substitution  r  s.t.  0  =  cr  0  t. 

Consider  a  labelled  modal  language  L  which  contains 
memory  cell  variables  CVar  =  {mi,  m2,  m3, . . .}, 
reference  variables  RVar  =  {mi,m2,m3 . . .}, 


6 


sentence  constants  Con  =  {ci,  C2,  C3, . . the  truth  constant  T 

and  is  closed  under  boolean  connectives  and  labelled  modalities  =  1,2,... 

(unary  operators). 

The  difference  between  L  and  £(M,  C)  is  that  the  cell  addresses  in  L  are  variables, 
unlike  £(M,  C),  where  they  are  constants. 

3.2  Definition.  Let  M  be  a  memory  set  and  C  a  data  constants  set.  An  interpre¬ 
tation  of  L  to  Fm{M,  C)  is  a  mapping  *  of  CVar  into  M  and  Con  into  C  which  is 
injective  on  Con.  The  interpretation  *  has  a  canonical  extension  to  all  L  fromulas: 

T*  =  T, 

for  p  6  i2Var  p*=p*, 

*  commutes  with  the  boolean  connectives, 

(MA)*  is  [piA*. 

We  say  that  a  L  formula  F  is  valid  in  a  reference  structure  5ft  =  (R,  0)  under  interpre¬ 
tation  *,  if  ^  F*.  A  reference  structure  5ft  is  a  model  of  a  given  set  F  of  Z  formulas 
under  given  interpretation  ♦  if  5ft  |=  A*  for  each  A  €  F. 

The  language  L  may  now  be  regarded  as  a  programming  language  for  reference 
structures.  Here  a  program  is  a  modal  formula  A  describing  the  properties  of  a 
reference  structure  5ft.  Satisfiability  of  A  means  the  existence  of  a  desired  reference 
structure.  The  satisfiability  algorithm  for  the  language  L  naturally  arises  from  the 
completeness  proof  of  the  calculus  CR,  (below). 

A  substitution  on  the  L  formulas  works  simultaneously  in  two  formats:  cells  and 
sentences.  No  special  restrictions  on  substitutions  are  imposed.  For  example,  an 
reference  variable  can  be  substituted  by  any  L  formula. 

Without  a  loss  of  generality  we  restrict  the  set  of  cell  variables  CVar  to  its  finite 
fragment  {mi,m2, . . .  ,mr}  (corresponding  restriction  should  be  put  on  the  set  of 
reference  variables).  Also  we  assume  that  Con  is  finite. 

3.3  Definition.  Under  0-^,3, p  we  mean  the  standard  m.g.u.  of  the  set  of  equations 

P  =  _  (2) 

m,-  =  mj  =>  m,-  =  nij. 

Here  the  “conditional”  part  is  standard  with  mi,mj  range  over  all  cell  variables 
occurring  in  “unconditional”  part  p  =  A  =  B.  Note  that  (Ta,b,p  is  an  idempotent  and 
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acts  on  the  variables  of  all  sorts,  mi(TA,B,p  is  a  cell  variable  and  mi(TA,B,p  is  a  formula 
from  L. 

3.4  Definition.  We  define  C  —  D  (mod  p  =  A  =  B)  to  stand  for 

“(7<t  =  Da  for  every  solution  a  of  (2)”. 

Apparently,  if  the  system  (2)  has  no  solution,  then  C  =  D  (mod  p  =  A  =  B) 
holds  for  all  C  and  D.  If  the  system  (2)  has  a  solution  then 

C  =  D  (mod  p  =  A  =  B)  ^  CaA,B,p  =  DaA,B,p- 

So,  the  relation  C  =  D  (mod  p  =  A  =  B)  is  decidable. 

Axioms  of  CTt: 

(Al)  The  classical  propositional  axioms  together  with  constants  {ci,C2,C3, ...,T} 
adopted  as  new  axioms, 

(A2)  IpIA  ^  A, 

(A3)  [pJA  A  MB  ^{C^D)  if  (7  =  D  (mod  p  =  A  =  B). 

Rule  modus  ponens. 

Axiom  (A3)  is  similar  to  the  unification  axiom  from  [2]  and  the  functionality  axiom 
from  [4]. 

3.5  Example.  The  following  is  provable  in  LIZ: 

•  -([pil^i  A  ...  A  if  the  system 

{Pk  =  Ak  (fc  =  l,...,n); 

p  =  q  p  =  q  for  all  cell  variables  p,  q  (3) 

occurring  in  [pjAi  A  ...  A  |[p„]A„ 

is  not  unifiable. 

•  [piJAi  A  ...  A  [pnjAn  {B  C)  if  Ba  =  Ca  for  the  most  general  unifier  a 
satisfying  the  condition  (3). 
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4  Completeness  theorem 


4.1  Lemma.  For  any  modal  formula  F  if  CR,  h  F ,  then  F*  is  valid  under  every 
interpretation  *  in  reference  structures. 

Proof.  A  straightforward  induction  on  the  proof  of  F.  ■ 

4.2  Theorem.  For  any  formula  F  E  L  ifCTZ  \f  F,  then  there  exists  a  finite  reference 
structure  3?  and  interpretation  *  of  the  language  L  into  3?  such  that  3?  ^  jP*. 

Now  we  introduce  a  Gentzen  style  formulation  of  C7i  and  prove  simultaneously 
the  completeness  theorem  along  with  the  cut  elimination  property  of  the  relevant 
Gentzen  style  system. 

In  what  follows  a  sequent  is  a  formal  expression  of  the  form  F  D  A,  where  F  and 
A  are  finite  sets  of  L  formulas. 

4.3  Definition.  CKq  is  the  following  sequent  calculus: 

Axioms: 

•  F  D  A  such  that  FnA^0orT  GAorcGA  for  some  c  E  Con. 

•  F  D  A  such  that  E  C  F  ,  where  H  =  |  i  =  1, 2, . . .}  and  the  system  (3) 

for  H  is  not  unifiable. 


Rules: 


•  Classical  rules  for  A,-'  and  structural  rules  together  with  the  cut-rule. 

•  A,r  D  A 
|[p|A,F  D  A 


S,  F  D  A  E,  F  D  A  ,  „  rir  -n  ..  i  •  i  o  i  j 

E  BTd  A  ’  s,  F"5  ^  {WA-  h  =  1,2, . . .}  and  (7 

is  the  most  general  unifier  of  (3)  for  E  and  obtained  as  a  result  of  the  standard 
unification  algorithm  U. 


4.4  Definition.  C'JZq  is  the  system  CTZa  without  the  cut  rule. 

The  following  lemma  claims  the  soundness  of  C'R.g  w.r.t.  CTZ. 

4.5  Lemma.  If  CR-q  h  F  D  A,  then  CR  h  A  T  ->•  V^-  ^ 

Proof.  Standard  induction  on  the  complexity  of  the  proof  of  F  D  A  in  CR,g-  ■ 

4.6  Definition.  Saturation  process  is  the  nondeterministic  procedure  constructing 

a  saturation  tree  labelled  by  pairs  (sequent,  substitution)  as  follows; 

Given  the  sequent  Fq  D  Ao  put 

Fq  =  Fo  U  {T}  U  {  the  set  of  all  constants,  occurring  in  Fo  D  Ao}, 

and  label  the  root  by  (Fq  3  Aq,  e),  where  e  is  an  empty  substitution,  and 
try  repeatedly  to  apply  the  saturation  rules  while  they  add  to  the  tree 
some  node  with  the  label  sequent  different  from  the  label  of  its  parent. 

The  rules  can  be  applied  to  an  arbitrary  leaf  of  the  current  part  of  the  tree 
if  its  label  sequent  F  D  A  is  not  an  axiom  of  CHg]  in  the  formulations  of 
the  rules  we  suppose  that  such  a  leaf  (a  current  node)  is  already  chosen 
and  labelled  by  (F  D  A,  cr). 

Saturation  rules: 

Rule  1.  If  A  A  R  G  F,  then  add  to  the  tree  a  son  of  the  current  node 
labelled  by  (F  U  {A,  B}  D  A,  <r). 

Rule  2.  If  A  A  R  €  A,  then  add  to  the  tree  two  sons  of  the  current  node 
labelled  by  (F  D  A  U  {A},  <t)  and  (F  3  A  U  {R},  <t). 

Rule  3.  If  ->A  €  F  (-lA  G  A),  then  add  to  the  tree  a  son  of  the  current 
node  labelled  by  (F  3  A  U  {A},<r)  (correspondingly,  (F  U  {A}  3 
A,o-)). 

Rule  4.  If  DpA  G  F,  then  add  to  the  tree  a  son  of  the  current  node 
labelled  by  (F  U  {A}  3  A,  (t). 

Rule  5.  Call  the  unification  algorithm  U  to  get  the  most  general  solution 
cr'  of  the  system  (3)  where  {|[p,]Ai,i  =  l,...,n}  is  the  list  of  all 
formulas  of  the  form  |[p,|A  from  F.  Add  to  the  tree  a  son  of  the 
current  node  labelled  by  (Fcr'  3  A<t',  <Tcr'). 

iPor  n  =  {Ai,A2,  . . .}  A^^  =  ^1  A  Aj  A  . . .,  and  =  Ai  V  A2  V  . . .. 
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4.7  Lemma.  If  (F  D  A,  cr)  is  a  label  in  a  saturation  tree,  then  for  any  variable  v 
occurring  in  T  D  A  we  have  vcr  =  v. 

Proof.  First  of  all  we  notice  that  none  of  the  variables  from  Dom{a)  occurs  in  Val{cr) 
since  cr  is  a  product  of  m.g.u.’s  each  enjoying  the  properties  of  lemma  3.1.  Consider  a 
step  5.  Any  variable  v  occurring  in  F  D  A  is  neither  from  Dom{cr)  nor  from  Dom[cr'). 
Thus  u  is  a  fixed  point  of  both  a  and  a'.  ■ 

4.8  Corollary.  For  any  label  {T  D  A,  a)  of  the  saturation  tree  any  subformula  A  of 
T  D  A  we  have  Aa  =  A,  (hence  Fcr  =  F  and  Acr  =  A). 

4.9  Lemma,  cr^  =  cr. 

Proof.  Dom{a)  D  Val{a)  =  0.  ■ 

4.10  Lemma.  The  saturation  process  terminates. 

Proof.  Rules  1-4  do  not  change  the  subformulas  of  the  sequent  so  they  can  not 
be  applied  infinitely  many  times.  Any  application  of  the  rule  5  reduces  the  set  of 
variables  occurring  in  F  D  A,  thus  any  path  in  a  saturation  tree  is  finite  and  the  tree 
itself  is  finite.  ■ 

Therefore  the  saturation  process  always  terminates  and  computes  some  saturation 
tree  of  a  given  sequent.  We  say  that  the  saturation  process  succeeds  if  it  produces  a 
saturation  tree  with  all  leafs  labelled  with  axioms;  otherwise  it  fails. 

4.11  Lemma.  If  the  saturation  process  on  a  given  sequent  succeeds,  then  the  sequent 
is  provable  in  CTZq. 

Proof.  A  saturation  tree  with  all  leafs  labelled  by  axioms  is  in  fact  the  tree-like 
derivation  in  CR.q  of  the  sequent  labeling  the  root.  ■ 

Suppose  the  saturation  process  fails  on  a  sequent  Fq  D  Aq.  Then  it  produces  a 
leaf  of  the  saturation  tree  labelled  by  (F  D  A,  <t)  such  that 

•  Fo(T  C  F,  Ao<t  C  a,  F  n  a  =  0,  T  €  F,  Con  C  F; 

•  if  (A  A  B)  €  F,  then  A  €  F  and  R  €  F; 
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•  if  (A  A  B)  e  A,  then  A  €  A  or  5  G  A; 

•  if  -lA  G  r,  then  A  G  A;  if-^A  G  A,  then  A  G  F; 

•  if  [pJA  G  r,  then  A  G  F; 

•  F  is  functional:  \p\A  G  F  and  \p\B  G  F  imply  A=  B. 


Now  we  are  ready  to  define  a  reference  structure  5ft  and  an  interpretation  *  which 
will  eventually  become  a  countermodel  for  Fo  D  Aq. 


Let  M  be  the  set  of  cell  variables  occurring  in  Fo  D  Aq,  which  are  fixed  points  of 
cr,  i.e. 

M  =  {m  G  CVar  \  mcr  —  m}. 

In  particular,  all  cell  variables  occurring  in  F  D  A  are  in  M.  Let  the  set  of  data 
constants  C  be  Con  U  D,  where  D  is  &  set  of  ’’new  constants”  corresponding  to  fixed 
point  reference  variables: 

D  =  {d{m)  I  mcr  =  m}. 

In  particular,  every  reference  variable  m  occurring  in  F  D  A  received  a  corresponding 
constant  d(m). 

It  is  clear,  that  the  fixed  point  reference  variables  remain  sort  of  parameters  of  the 
future  reference  structure  and  they  can  be  evaluated  in  either  way.  However  the  rules 
of  the  game  require  them  to  become  ground  sentences.  The  easiest  way  to  ensure  it 
is  to  introduce  special  new  constants  to  evaluate  these  variables. 

Now  the  set  of  Fm{M,  C)  is  defined. 

Put  for  any  fixed  point  reference  variable  m 


A(m) 


d{m),  if  m  G  F, 
->d{Tn),  otherwise. 


Now  for  any  subformula  A  of  F  D  A  we  define  a  ground  A^  G  Fm{M,  C):  cf  =  Ci-, 
=  T,  =  A(m),  ([m|J5)^  =  [m]5^.  Note  that  the  translation  ^  is  injective 
since  no  constants  d{m)  are  unified  for  different  m  e  M.  So,  we  will  write  B\  instead 
of  B^  understanding  A  as  the  substitution  {rn^lX{Tr^),rn^fX{m^) . . .}.  Put 


9  =  crA, 
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and  let 


R  =  {(m,  A)  I  [m]A  G  F}. 

We  have  to  establish  now  that  9?  =  (R,  9)  is  a  reference  structure.  First,  6  is  clearly  a 
ground  substitution.  Then  it  is  easy  to  see  that  9  unifies  R.  Indeed,  let  (m.  A)  €  R, 
then  [m|A  G  F,  and,  by  the  saturation  construction  mcr  =  Aa.  Thus 


m9  =  mcr\  =  AcrX  =  A9. 


4.12  Lemma. 

A  G  F  ^  g  A9, 
A  G  A 


Proof.  Induction  on  A.  The  cases  A  is  a  constant  c,-,  T  as  well  as  the  case  A  is  an 
reference  variable  m  are  covered  by  the  definitions  of  F  D  A  and  9.  The  cases  of 
boolean  connectives  are  trivial  by  the  construction  of  the  saturated  sequent  F  D  A. 
Now 

IpijB  G  F  {m,B)  G  i?  m9  =  B9  \pi\B9. 

Let  now  fmjB  G  A.  If  m  ^  Dom{R),  then  clearly  \pi\B9.  Let  now  m  6 
Dom{R),  i.e.  \p{\B'  G  F  for  some  B\  then  m9  =  B’9.  If  B9  =  B'9,  then  Bcr  =  B'a, 
since  A  is  injective,  and  thus  B  =  Bcr  =  B'cr  =  B\  which  is  impossible  because 

FnA  =  0.  ■ 

4.13  Lemma.  9?  =  {R.,9)  is  a  reference  structure. 

Proof.  It  only  remains  to  check  that  rn9  is  valid  on  Dom(R).  Let  m  €  Dom{R), 
then  (m,  A9)  G  R9  and  there  exists  some  B  such  that  [m]5  G  F  and  B9  =  A9.  Then, 
by  the  saturation  property,  B  G  F,  thus,  by  lemma  4.12  g  B9,  i.e.  A9.  ■ 

Now  we  define  an  interpretation  *  by  m*  =  m,(T,  and  thus  A*  =  Act  for  any  L 
formula  A.  Note  that  Acr  is  simultaneously  a  formula  of  the  reference  structure  9?. 

It  is  almost  trivial  now  that  9?  ^  (A  Fq  — >•  V  Ao)*.  Indeed,  if  A  G  Fq,  then  Act  E  F, 
and  9?  ^  A<t^,  by  lemma  4.12,  and  9?  ^  A*.  Similarly,  if  A  G  Ao,  then  91?  ^  A*. 

Thus  we  have  established  the  following;  for  any  sequent  F  D  A  in  the  language  L 
£71gI/FdA  9?^(AF-4VA)*  £7^I//\F->VA  £7eGl/FDA, 
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which  together  with  the  trivial 


gives 

4.14  Corollary.  (Cut  elimination  for  CTZa)  CIZq  =  C'Rq 

4.15  Corollary.  CTZa  is  an  adequate  Gentzen  style  formulation  of  CR,. 

4.16  Corollary.  C7l  is  decidable. 

Let  us  complete  the  proof  of  Theorem  4.2.  Fix  a  formula  F  satisfying  the  condi¬ 
tions  of  the  Theorem  4.2.  Put  To  =  0  and  Aq  =  {i^}.  Since  CRq  1/  Fq  D  Aq  the 
saturation  process  on  the  sequent  To  D  Aq  fails,  and  there  is  a  reference  structure  3? 
such  that  3?  A  To  — V  Ao,  i.e.  3?  ^  F.  ■ 

A  lazy  inspection  of  the  completeness  proof  above  demonstrates  that  the  size  of 
a  countermodel  (in  a  dags  form)  of  a  given  L  formula  A  can  be  made  less  than  cZ"*, 
where  I  is  the  length  of  A,  and  c  fixed. 

Also,  on  the  basis  of  lineartime  unification  algorithms  from  [5],  [7]  one  can  easily 
proof  the  following  time  complexity  bounds  for  some  natural  problems  in  reference 
structures. 

4.17  Theorem. 

1.  The  problem  "whether  3i  =  (/?,  0)  is  a  reference  structure”  is  polytime. 

2.  The  satisfiability  problem  for  the  language  L  is  NP -complete. 


5  Reference  structures  building  and  optimization. 

The  language  L  can  now  be  considered  as  a  programming  language  for  designing 
reference  structures  with  reading  procedures.  A  program  here  is  a  labelled  modal 
formula  P  describing  the  properties  of  some  reference  structure  3?  .  The  satisfiability 
algorithm  extracted  from  the  proofs  of  the  Theorem  4.2  checks  whether  P  is  satisfiable 
and  constructs  a  finite  model  of  P,  which  is  a  desired  reference  structure. 
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We  reduce  the  problem  of  constructing  a  model  of  P  to  the  problem  of  construct¬ 
ing  a  countermodel  for  the  sequent  P  D.  The  saturation  algorithm  checks  whether 
this  sequent  is  provable  and  transforms  it  into  the  sequent  F  D  A  with  saturation 
properties.  If  saturation  succeeds,  then  CTZ  h  -'P,  and  thus  there  is  no  reference 
structure  satisfying  the  condition  P.  If  saturation  fails,  then  we  have  a  quadratic  of 
the  size  of  P  reference  structure  5ft  and  an  interpretation  *  such  that  P*  is  valid  in  5ft. 

Let  us  consider  an  example  of  a  problem  “initialization  of  typed  variables” ,  which 
comes  from  some  common  programming  languages  like  PASCAL,  C,  etc. 

5.1  Example.  (Initialization  of  typed  variables).  We  consider  the  following  variant 
of  commonly  used  typing  system.  Let  T  be  a  finite  set  of  primitive  types  with  domains 
Dr,  r  £  T  (the  domains  are  supposed  to  be  decidable  but  not  necessary  disjoint). 
The  set  of  all  types  Type  is  constructed  from  T  by  the  rules: 


Rule  : 

Domain  : 

(Structure) 

Ti  €  Type,  \  <i  <n 
{Ti,...,r„}  G  Type 

Di  X  ...X  Dn 

(Union) 

Ti  €  Type,  1  <  i  <  n 

G  Type 

L>i  U  . . .  U 

(Subset) 

T,  G  Type,  a,-  G  L>t.  ,  1  <  i  <  n 

T {Ul )  •  •  •  5  Un} 

Setj)f{ai\Ti,...,an\Tn)  eType 

The  initialization  problem:  given  a  type  t  ^  T ype  and  an  object  a  G  UrsTj/pe 
we  have  to  check  whether  a  E  Dr  and,  if  it  is,  to  build  a  data  structure  which  stores  a 
as  an  object  of  type  r  together  with  some  address  which  is  the  value  of  corresponding 
pointer. 

The  basic  elements  to  construct  a  reference  structure  from  are  constants  for  objects 
of  primitive  types.  A  reference  structure  is  supposed  to  represent  the  type  structure 
in  a  way  that  provides  a  direct  access  to  any  subobject  of  a  given  object. 

With  the  pair  (r,  a)  we  associate  a  formula  $(t,  a)  G  L  and  a  cell  variable  p  in  it: 
{Primitive  type):  t  and  a  €  Dr-  Then 

$(r,  a)  =  |[p]Cr,a, 
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where  Cr,o  is  a  data  constant. 


(Structure):  r  =  {Ti,...,r„}  and  a  =  (ai,...,a„).  Then 

n 

«)  =  ( A  ^0  A  IpKpi  A  . . .  A  p„), 

»=i 

where  $,■  is  a  variant  of  $(t,,  a,)  obtained  from  it  by  renaming  the  variables  of 
the  form  q,  q  (so  $,•  and  for  i  ^  j  do  not  have  common  variables)  and  p,-  is 
the  associated  cell  variable. 

( Union):  r  =  {ti;  . . . ;  r„}.  Then 


$(r,a)  =  (V  $'•)  A  |[p]g, 

t=i 

where  =  ^i[qlpi\i  q  is  a,  new  variable  and  $,•  is  obtained  from  $(r,',a)  in  the 
same  way  as  for  (Structure). 

(Subset):  r  =  Setjof(ai\Ti, . . . ,  a„|r„)  and  a  C  {ai, . . . ,  Cn}.  Then 

$(r,  a)  =  (  /\  -np.)  A  (  /\  $i)  A  |[pl(“'lbolT  <->  pi  V  . . .  V  p„), 

a,-^a  aiEa 

where  po  is  a  new  cell  variable  and  p,-,  (1  <  i  <  n)  are  the  same  as  for 
(Structure).  Here  the  formula  IpojT  indicates  whether  the  object  a  is  empty. 

(Type  mismatch):  In  all  other  cases  ^(t, a)  =  [p]_L. 

In  all  cases  the  associated  cell  variable  is  p. 

It  is  easy  to  see  that  $(r,  a)  is  satisfiable  iff  a  G  The  satisfiability  algorithm 
transfers  it  into  a  data  structure  implementing  the  initialization 

V  :=  a 

for  a  variable  v  of  type  r.  The  interpretation  p*  of  the  associated  cell  variable  is 
an  address  sufficient  to  restore  all  the  information  about  the  value  of  u  as  an  object 
of  type  r.  It  is  a  natural  pointer  value.  Specific  features  of  the  implementation  are 
reflected  in  $(r,  a);  it  plays  a  role  of  a  program  for  building  this  data  structure.  The 
examples  of  resulting  data  structures  are  shown  on  Fig.l. 


16 


Structure 


Union 


r  =  {ri,T2},  a  =  (01,02). 
[Piloi  A  Ip2}a2  A  [pK^  A 


r  =  {ti,  T2},  a  e  DriU  D 
(Mai  V  M«2)  a  IpI? 

p- 1  q  I 


Subset 

T  =  Set.of{ai  :  ri,02  :  r2,03  :  rs),  o  =  {02,03}. 
A  I>2la2  A  |[p3la3  A  |[pI(“'bolT  ^  ^  V  ^  V 


bojT  -H-  Pi 


I 


T  =  Setjof{ai  :  ri,02  :  r2,03  :  T3),  o  =  0. 

^  A  A  A  |[pl(“'bolT  ^  ^  V  ^  V 


boJT  ^  Pi  V  P2  V  P3 


Po  : 
T 


Note  that  we  have  chosen  the  variant  of  the  program  $(r,  a)  where  all  possible  sim¬ 
plifications  are  already  done.  This  job  can  be  left  to  the  satisfiability  algorithm  too. 
For  example  in  the  case  of  (Structure)  when  r  =  {ri, . . . ,  t„}  and  a  =  (ci, . . . ,  Um) 
we  may  take  the  following  variant: 


MiPi  A  . . .  A  p„)  A  (/\  $,)  A  [pKpi  A  . . .  A  p„,). 

1=1 

It  is  equivalent  to  $(r,  a)  and  the  algorithm  transfers  it  into  the  same  data  structure. 

In  order  to  construct  a  reference  structure  which  uses  only  one  cell  instead  of  many 
containing  the  same  record,  i.e.  to  construct  a  reference  structure  with  a  functional 
conversion  of  the  ground  storage  relation  {R0)~^  or,  even  more,  with  invertible  reading 
procedure  6,  we  introduce  the  logics  ClZi  and  CTZi-i.  The  logic  CTZi  is  CR,  +  (A4) 
where  (A4)  is  the  following  axiom  scheme: 

(A4)  [pM  a  yjA  ^  (5  <4  B\p'/p]). 


CTli-i  is  the  modification  of  CTZ  where  the  “conditional”  equality 


p=q^p=q 


is  replaced  by 


p  =  q^p  =  q. 


5.2  Theorem.  For  any  labelled  modal  formula  F 


1.  CKi  F  iff  F*  is  valid  for  all  interpretations  *  in  finite  reference  structures 
with  functional  relation  (R6)~^ ; 

2.  CTZi-i  A  iff  A*  is  valid  for  all  interpretations  *  in  finite  reference  structures 
with  invertible  reading  procedure  6. 


Proof.  Similar  to  the  proof  of  the  completeness  Theorem  4.2.  ■ 

The  logics  CRi  and  CR-i-i  are  also  decidable.  The  satisfiability  algorithms  from 
the  completeness  proofs  for  these  logics  can  be  used  in  the  same  way  as  that  for  CTt 
to  construct  reference  structures  without  double  stored  sentences.  The  complexity 
bounds  from  Theorem  4.17  are  also  preserved. 
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